Real-time data access monitoring in distributed, multi- petabyte systems
نویسندگان
چکیده
Petascale systems are in existence today and will become common in the next few years. Such systems are inevitably very complex, highly distributed and heterogeneous. Monitoring a petascale system in real-time and understanding its status at any given moment without impacting its performance is a highly intricate task. Common approaches and off-theshelf tools are either unusable, do not scale, or severely impact the performance of the monitored servers. This paper describes unobtrusive monitoring software developed at Stanford Linear Accelerator Center (SLAC) for a highly distributed petascale production data set. The paper describes the employed solutions, the lessons learned, the problems still to be addressed, and explains how the system can be reused elsewhere
منابع مشابه
Designing a Multi-petabyte Database for LSST
The 3.2 giga-pixel LSST camera will produce approximately half a petabyte of archive images every month. These data need to be reduced in under a minute to produce real-time transient alerts, and then added to the cumulative catalog for further analysis. The catalog is expected to grow about three hundred terabytes per year. The data volume, the real-time transient alerting requirements of the ...
متن کاملEvaluating the Efficiency of Firms with Negative Data in Multi-Period Systems: An Application to Bank Data
Data Envelopment Analysis (DEA) is a mathematical technique to evaluate the performance of firms with multiple inputs and outputs. In conventional DEA models, the efficiency scores of Decision Making Units (DMUs) with non-negative inputs and outputs are evaluated in a special period of time. However, in the real world there are situations wherein performance of firms must be evaluated in multip...
متن کاملDesign for Deterministic Monitoring
In order to test, or debug, a system we must observe its run-time behavior and deem how well the observations comply with the system requirements. There are two significant differences between debugging and testing of software for desktop computers and embedded real-time systems: (1) It is more difficult to observe embedded computer systems, simply because they are embedded, and that they thus ...
متن کاملError Modeling in Distribution Network State Estimation Using RBF-Based Artificial Neural Network
State estimation is essential to access observable network models for online monitoring and analyzing of power systems. Due to the integration of distributed energy resources and new technologies, state estimation in distribution systems would be necessary. However, accurate input data are essential for an accurate estimation along with knowledge on the possible correlation between the real and...
متن کاملA Generic Model for Distributed Real-Time Scheduling Based on Dynamic Heterogeneous Data
In recent years multi-agent systems have demonstrated the power to handle complex real-time scheduling problems. They have also been used to integrate distributed information to provide access to heterogeneous data sources. Combining these two approaches to produce a multi-agent system that can use distributed heterogeneous data for real-time scheduling is a challenging problem. In this paper w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008